Fast in-loop MCQA #281

davidheineman · 2025-05-19T17:21:11Z

Bump in-loop evals to v0.8.1. This will add "fast" MCQA, which performs MC tasks in 1 forward pass instead of 4 forward passes. (We extract the A/B/C/D logits from a single pass).

allenai/OLMo-in-loop-evals#8

This will make the MC tasks 4x faster, and produces the same numbers.

Also, added Java, Rust and C++ translated MBPP BPB.

…ast-mc

epwalsh

Sweet!

Incorporate this one-line PR: allenai/OLMo-in-loop-evals#12 TL;DR: #281 made in-loop RC and BPB slower, this fixes that bug. **The RC/BPB in-loop evals run with `ai2-olmo-eval~=0.8.0` are correct evals, just slower.**

Bump in-loop evals to v0.8.1. This will add "fast" MCQA, which performs MC tasks in 1 forward pass instead of 4 forward passes. (We extract the A/B/C/D logits from a single pass). allenai/OLMo-in-loop-evals#8 This will make the MC tasks 4x faster, and produces the same numbers. Also, added Java, Rust and C++ translated MBPP BPB.

Incorporate this one-line PR: allenai/OLMo-in-loop-evals#12 TL;DR: #281 made in-loop RC and BPB slower, this fixes that bug. **The RC/BPB in-loop evals run with `ai2-olmo-eval~=0.8.0` are correct evals, just slower.**

davidheineman added 4 commits May 16, 2025 21:22

add fast mc configs

e6fb579

in-loop 0.8.1

9a3b023

add fast mc configs

32c0391

Merge branch 'fast-mc' of https://github.com/allenai/OLMo-core into f…

529e3a7

…ast-mc

davidheineman self-assigned this May 19, 2025

update changelog

675fe7e

davidheineman requested a review from epwalsh May 19, 2025 17:27

epwalsh approved these changes May 19, 2025

View reviewed changes

epwalsh merged commit 776778e into main May 19, 2025
15 checks passed

epwalsh deleted the fast-mc branch May 19, 2025 17:45

davidheineman mentioned this pull request May 27, 2025

Bump ai2-olmo-eval==0.8.3 (RC/BPB speed fix) #285

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fast in-loop MCQA #281

Fast in-loop MCQA #281

Uh oh!

davidheineman commented May 19, 2025

Uh oh!

epwalsh left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fast in-loop MCQA #281

Fast in-loop MCQA #281

Uh oh!

Conversation

davidheineman commented May 19, 2025

Uh oh!

epwalsh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants